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Abstract: In this paper we consider a group sequentially monitored trial 
on a survival endpoint, monitored using a weighted log-rank (WLR) statis- 
tic with deterministic weight function. We introduce a summary statistic in 
the form of a weighted average logged relative risk and show that if there 
is no sign change in the instantaneous logged relative risk, there always ex- 
ists a bijection between the WLR statistic and the weighted average logged 
relative risk. We show that this bijection can be consistently estimated at 
each analysis under a suitable shape assumption, for which we have listed 
two possibilities. We indicate how to derive a design-adjusted p-value and 
confidence interval and suggest how to apply the bias-correction method. 
Finally, we document several decisions made in the design of the NLST 
interim analysis plan and in reporting its results on the primary endpoint. 
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1. Introduction 

Time to event, e.g. disease specific mortality, is the primary endpoint in many 
clinical trials. The use of group sequential boundaries in monitoring the trial is 
not only commonplace, but ethically mandated in all trials of human subjects. 
The logrank statistic is often the monitoring statistic of choice due to its natural 
connection with the relative risk, which is often the parameter of inference. This 
natural connection, which is based upon the assumption of proportional hazards, 
admits a one-to-one correspondence between the inferential procedure based 
upon the usual standard normal scale and that based on the scale of the natural 
parameter. However, the assumption of proportional hazards is not always a 
reasonable assumption. In many subject areas, e.g. in disease-prevention trials, 
one expects that the hazard ratio will not be constant. Much of the prior work 
on the use of the weighted logrank statistic in a sequential design is confined 
to the use a weighting function from the G pn (t) = S p (t)(l — S(t)) 7 family, of 
Fleming and Harrington, [2]. They suggest two major types of problems which 
can arise. First, they argue that use of the weighted logrank statistic does not 
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reproduce the single point analysis in the way that is desired. Most notably, 
they argue, there is no clinically meaningful parameter that allows the values 
of the monitoring statistic and sequential boundaries to be cast into a clinically 
meaningful scale. They believe that this problem is further aggrivated when the 
range of the weighting function over the duration of the trial is quite large, such 
as is the case with the G 01 weight function (Gillen and Emerson, [4]) and suggest 
a re- weighting scheme whereby the most weight is given to the most recent data 
collected at each analysis. Secondly, they argue that if the chosen weighting 
function is non-deterministic or trial-specific then it is impossible to compare 
results from different clinical trials, (Gillen and Emerson, [3, 5]). While the bulk 
of these cautious remarks arc useful to know in their own right, several important 
points have been omitted from the discussion. Firstly, as we will show, there is 
a natural, clinically meaningful parameter, the weighted average logged relative 
risk, that is connected bijectively to the weighted logrank statistic when there 
is no change in sign in the instantaneous logged relative risk. Under suitable 
shape assumptions, the bijection can be estimated at each analysis. We will 
show that the asymptotic distribution of the WLR statistic, suitably normalized 
is a Brownian motion plus drift under nothing but boundeness conditions. In 
two corollaries, we demonstrate how each of two presented shape assumptions 
translates into a form of the drift function and consequently, into an estimator of 
the weighted average logged relative risk. We then demonstrate how the usual 
results concerning monitoring and end of trial estimation follow. Finally, we 
note that this bijection between the weighted logrank statistic and the weighted 
average logged relative risk allows the values of the monitoring statistic, efficacy 
and futility boundaries, and reported point estimate and confidence interval to 
be cast into a clinically meaningful scale. 

2. Terminology and framework 

We consider a two armed randomized trial of the effect of an intervention upon 
a time to event that is run until time r. Let Tj be the possibly unobserved 
time to event and let Ci a right censoring time. We assume non-informative 
censoring for simplicity. Let Tj = Xj A C, be the observed time on study and 
let Si = I(Ti < Ci) be the event indicator. Let Xi indicates membership in the 
intervention arm (Xj = 1) or control arm (Xi = 0). We assume, conditional upon 
Xi, that individuals, i = 1, . . . ,n are distributed independently and identically. 
Let dH$(t) and dH\(t) be the trial arm specific cumulative hazard increments. 
We assume throughout that H (t) is finite for all t on [0, r]. For the instantaneous 
logged hazard ratio, we write 



Let Ni(t) = I(Ti < t,5 t = 1) and dN^t) = N,(t) - N^t-) be the subject 
level counting process and its increments, respectively. Let N n (i) = ^2iNi(t) 




(2.1) 
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and dN n (t) — N n (t) — N n (t—) be the aggregated counting process and its incre- 
ments, respectively. Note that the following difference is a compensated counting 
process martingale: 

dMi(t) = dNi{t) - I(Ti > t) exp(X J /3(i))d J ff (i) (2.2) 

Let E n (t, 0) = Y.i x iI{ T i > t)/ Ei I ( T i > *) denote the proportion of the popu- 
lation at risk at time t in the intervention arm, and let e(i, 0) = lim a . s . n ^-oo E n {t, 0) 
and let G{t) = linw dN n (t)/n. Let F n (t) = J* E n (£,0)(l-E n (£,0))dN n (£)/n 
and let IF(t) = J* e(£, 0)(1 — e(£, 0)) dG(£_). We introduce the following notation 
for cross moment integrals against dIF over (0,i): 

(^|JF|V 2 )t = f MO M0dF(0 . (2.3) 



For reasons that will become clear below, we consider the target of our investi- 
gation to be the following weighted average logged relative risk: 

* (» (24) 

Let q(t) = f3(t)//3*. This provides a representaton of the instantaneous logged 
relative risk function, j3(i) = (3* q(t) as the product of its weighted average 
value, /3* times a shape function, q. Note it follows that the shape function has 
weighted average value equal to 1: 

_ (Q\IF\q) T 

Jq\W\r^' (2 - 5) 

At follow-up time t, the y/n normalized score statistic with weighting function 
Q is: 

1 " t* 

U n (t) = -=J2 Q(0{Xi-E n ^,0)}dNi(0- (2-6) 

Its estimated variance is: 
1 f* 

V n (t) = - Q 2 (OE n (Z,0)(l-E n (Z,0))dN n (0 = (Q\IF n \Q) t . (2.7) 



Let v(t) = lim . s . V n (t). Note that v(t) = {Q\JF\Q) t . Let f n {t;r) = V n (t)/V n (r) 
and f{t;r) = v(t)/v(r). We will on occasion use the shorthand f n j and fj 
for fn(t;r) and /(t;r), respectively. Also, let m n (t) = (Q\JF n \Q)t and m(i) = 
(Q\JF\Q) t - We consider the weighted log-rank (WLR) statistic at time t on 
several "scales" 



(i) The standard normal scale: Z n (t) = U n (t) j \JV n {t) 

(ii) The "Brownian scale": X n (t) = U n (t)/y/vJ7j 
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3. Main Result 

Condition 3.1. The instantaneous logged relative risk function, (3, is bounded 
on [0, t]. 

Condition 3.2. The chosen weighting function, Q, is bounded on [0, r] and 
deterministic. 

Recall that a weighting functions is always non-negative. The stipulated 
boundedness in conditions 3.1 and 3.2 above can be relaxed to being of class L 2 
with respect to the measure dlF, as this is all that is really required. 
While the context will involve monitoring the statistic at a sequence of interim 
analyses, for the time being, we suppress this aspect and consider instead the 
following more general and generic result which holds under the weakest set of 
assumptions: 

Theorem 3.1. Under conditions 3.1 and 3.2, then under the family of local 
alternatives, /?* = b* j ' \fn, the score statistic, normalized to the "Brownian scale" 
is asymptotically a Brownian motion on [0, 1] plus a drift. 

X n (t)^W(f(t; T ))+»(t) (3.1) 

where the "time scale" for the Brownian motion is the variance ratio or infor- 
mation fraction, f(t;r) — v(t)jv(r), and the drift, parameterized by t is 

H(t) = W£M± 6*. (3.2) 
y/{QW\Q)r 

The proof of 3.1 is given in appendix 8.1. Notice, first, that from equations 
2.5 and 3.2, it follows that the value of the drift function at the scheduled end 
of the trial is 

Mr)= <Q\mi)r ^ (33) 

V(Q\f\Q)t 

Thus, without any additional assumptions on the shape function, q, we have the 
following corollary: 

Corallary 3.1. At the planned conclusion of the trial, t, an estimate of ft* is 
given by the following: 



y/n (Q|JF„|1) T 



(i) j3* is unbiased 

(ii) An estimate of its variance is given by 



var 



_ (Q\F n \Q) T 



G. Izmirlian/ Estimation following group sequential trial 



5 



4. Estimates of (3* in a Trial Stopped Early 

Obtaining an estimate of /3* at a trial stopped early due to an efficacy boundary 
crossing will require more assumptions on the shape function, q. At a minimum 
in order to have a monotone drift function which is necessary for propper mon- 
itoring, we require the following. 

Condition 4.1. The shape function, q, is non-negative. 

Since the drift's function's dependence on t is through an integral of a non- 
negative function, we have the following corollary: 

Corallary 4.1. // conditions 3.1, 3.2 and J^.l are true then the conclusion of 
theorem 3.1 holds and the drift function is monotone increasing or decreasing 
in t, depending upon the sign of b* . 

Note also that as the inverse of an increasing function is also increasing, 
the drift function can also be considered a monotone function of the informa- 
tion fraction. This would, of course, lead to a natural estimate of (3* in a trial 
stopped early except for the fact that we have no knowledge of q. In order to 
have a more useful estimator for f3* in trials stopped early, we opt for a semi- 
parametric model. In the following, we list two possibilities. The most natural 
shape condition to impose is true if our choice of weight function was the optimal 
one among all possible choices. 

Condition 4.2. The shape function, q, is proportional to our chosen weighting 
function, q(t) ~ K Q(t). 

Note that as the weighted average of the shape function must equal 1 as in 
equation 2.5 it follows that the constant of proportionality, K, must be 

K= g>I. (4.1) 

Corallary 4.2. // conditions 3.1, 3.2 and 4-2 are true then 

(i) X n is asymptotically a Brownian motion with a drift that is linear in the 
information fraction: 

«t)= j Q|jF|1) - mr)V. (4.2) 

(ii) If the trial is stopped at an analysis number J at calender time tj due to 
an effacacy boundary crossing, then we have the following estimate of (3* 



X n (tj) y/ {Q\F n \Q) T 

' fn(tj;r) ^(Q|JF„|1) T 1 • > 



(Hi) An estimate of the mean-squared error is given by: 

{QWn\Q)r 



nf n (tj;r) {Q\IF n \l)% 



(4.4) 
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Another natural shape condition is true when we have opted for a weighted 
statistic but the true shape is constant. 

Condition 4.3. The shape function, q, is identically 1. 

Corallary 4.3. // conditions 3.1, 3.2 and 4-3 are true then 

(i) X n is asymptotically a Brownian motion the following drift: 

where r{t;r) = (Q\lF\l)t/(Q\lF\l) T , which is an increasing function oft 
and takes the values at t = and 1 at t = r. 
(ii) If the trial is stopped at an analysis number J at calender at time tj due 
to an effacacy boundary crossing, then we have the following estimate of 

~ X n (tj) V(Q\JF n \Q) T 

P r„(tj;r) y/H {Q\F n \l) T ' 1 

where r„(t;r) = (Q\IF n \l) t /(Q\F n \l) T 
(Hi) An estimate of the mean-squared error is given by: 



fn(tr,T) (Q\IF n \Q) T 



nr n {tj-Tf {Q\IF n \l)l 



5. Application to Monitoring and Final Reporting in a Clinical Trial 

The relationship between the drift of the WLR statistic and the weighted average 
logged relative risk parameter provided by theorem 3.1 and its corallaries can 
be used in the monitoring and final reporting of a clinical trial. 



5.1. Futility Boundary 



Our comments regarding monitoring a trial are made within the context of 
boundaries constructed using the Lan-Dcmcts procedure, [6]. Construction of 
the efficacy boundary is done under the null hypothesis that the drift function is 
identically zero and can be done without appealing to the results presented here. 
If a futility boundary is specified in the design then under either of the shape 
assumptions, one can apply the corresponding corollary 4.2 or corollary 4.3 to 
calculate the drift function at each interim analysis which is required to compute 
the futility boundary under the Lan-Dcmets approach [6]. Note that the shape 
assumption being made must be part of the interim analysis plan design. In the 
following discussion we will assume that the optimal weighting shape condition 
4.2 was specified in the design so that the discussion focuses on the application 
of corollary 4.2. In this case, (5* is the weighted average logged relative risk for 
which the study is powered to detect and must also be specified in the interim 
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analysis plan design. The values of v(t) = {Q\IF\Q) T and m(r) = (Q|-ZF|l) r at 
the planned termination of the study, r, must also be specified in the interim 
analysis plan design. We demonstrate in appendix 8.2 when the only source 
of censoring is administrative censoring or other cause mortality, how these 
functionals can be projected for a specific choice of weighting function, Q, based 
upon projected values of the cross-arm pooled cumulative hazard function at 
several landmark times on study. We remark here that following consensus, we 
recommend using a non-binding futility boundary which is constructed after 
construction of an efficacy boundary which ignores the existence of the futility 
boundary. This is preferred to the joint construction of efficacy and futility 
boundaries as that approach results in a discounted efficacy criterion. 



5.2. Prediction at End of Trial 

When the trial is stopped at an efficacy or futility boundary crossing, or at the 
scheduled end of the trial, and if the optimal weighting shape assumption 4.2 
was specified in the design, then corollary 4.2 can be used to convert the value of 
the WLR statistic on the Brownian scale, X n (tj), to an estimate of the weighted 
average logged relative risk, /3*. Therefore, our point estimate is 

? X n (t 3 ) V(Q\IF n \Q)r . . 

1 f nJ v^(Q|iF„|l} T 1 ' ; 

We use the values of v(t) = (Q\IF\Q) T and m(r) = (Q\JF\1) T which are spec- 
ified in the interim analysis plan design. As mentioned above, when it is ob- 
tained at an efficacy boundary crossing, these type of estimates are known to 
be biased away from the null (see e.g. Liu and Hall, [7]). The construction of 
a design-adjusted confidence interval and adjustment of this estimate for the 
above mentioned bias are standard results, especially under the optimal weight- 
ing shape condition 4.2 which leads, in corollary 4.2, to a drift that is linear 
in the information fraction. For sake of completeness, we outline below how to 
compute a design adjusted p-value, construct a design-adjusted confidence in- 
terval and how to calculate the bias adjusted estimate of the weighted average 
logged relative risk. All three of these tasks involve the sampling density under 
the null hypothesis of the sufficient statistic, (J, X n (tj)), where J and X n {tj) 
are the analysis number and the value of the weighted logrank statistic at an 
efficacy crossing. The sampling density of (J, X n (tj)) takes the following form. 
First, for j = 1, 7r((l,x)) = TP{X n (h) = x}. For j > I, 

w((j,x) ; bx-y-x^fiy) (5.2) 
= ^HoW = j and X n (t e ) < y/Teh , t = 1, . . . ,j - 1, X n {t 3 ) = x] 

Here b 1 . ^_ 1 - ) is the sequence of efficacy boundary points at all prior analyses and 
is the sequence of information fractions at all analyses prior and current. In 
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the following h\-j and i\ j for £ < I denote the empty sequence. The construction 
and form of this density is reviewed in appendix 8.3. Let 

/•OO 

R((j,x);'h- L . u _ rj ,fi :j ) = / 7r((i,0;bi:(j_i),fi :j )^ (5.3) 

J x 

be the joint probability under w that J = j and X n (tj) is in the right tail 
(a;, oo). In order to calculate a p- value and construct a confidence interval which 
account for the sequential design, we must choose an ordering of the sample 
space for the statistic (J, X n {tj)). Here we prefer to use the following ordering: 
(j, x) > (k,y) if and only if (j = k and x > y) or j < k. This ordering is 
applicable when the rejection region is convex, as is the case with Lan-Demets 
boundaries constructed using a smooth spending function. The discussion of 
the p- value and of the confidence interval is in the setting of symmetric 2-sidcd 
boundaries and when sign of the alternative hypothesis is positive as it is a 
simple matter to apply these results to the case where the sign of the alternative 
hypothesise is negative. 



P-value 

Under the ordering given above, the region further away from the null than 
(J, X n (tj)) is the union of all prior rejection regions with the right tail at X n (tj). 
Thus the design-adjusted or sequential p-value is: 

j-i 

fl(( J, X n (tj)); b!^!) , f UJ ) + 3((4 bi); b w _ x , f iu ) , (5.4) 



Confidence Interval 

If the probability of type one error that remained prior to anah/sis J is a tot — 
then a two sided design-adjusted confidence interval for /3* is derived as 
follows. If we denote by x u the solution in x of the equation 



j-i 



atot - otj-i = n((J, x); bi : (j_i), f i:J ) + ^ Il((£, bt); bi^-i, fi^) , (5.5) 



then the design-adjusted confidence interval is 



mse 



0* 



(5.6) 



where mse 



P* is the estimated mean-squared error of P* as given in part (iii) 



of corollary 4.2. Note that when the efficacy boundary is one-sided one can still 
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construct a 2-sided confidence interval by replacing a to t — ozj-i above with 1/2 
its value. 



Bias Adjustment 

As in Liu and Hall, [7], bias adjustment is done recursively as follows. First, 



C(M) = f (5-7) 



Continuing, 



C0»= / C(i-l,e)vr((i-l,0;bi:( j -i),fi:0--i))^ A .(a ; -C)^ (5.8) 

J — oo 

The bias adjusted estimate, ft*, of the weighted average logged relative risk, 
ft*, is obtained by replacing X n (tj)/f n ,j in part (ii) of corollary 4.2 with 
C,(J, X n (tj)) to obtain the following: 



p = aj,x n ( tJ )) ffi'^'ff; (5.9) 

The design-adjusted confidence interval is the same as given above, but now 
centered about ft* 



ft* 



(5.10) 



6. The NLST 



The design of the National Lung Screening Trial (NLST) [8] interim analysis 
plan stipulated a one-sided efficacy boundary constructed using the Lan-Demets 
procedure with a total probability of type one error set to 0.05. The trial had 
90% power to detect a relative risk of 0.79 at a sample size of 25,000 per arm, 
accounting for contamination and non-compliance that could attenuate this ef- 
fect to 0.85. The trial began randomization on August 5th, 2002 and concluded 
randomization on April 26th, 2004. A non-binding futility boundary was used. 
The drift was derived under the optimal weighting shape assumption, 4.2, and 
incorporated the design alternative ft* = log(0.85). Initial estimates of v(t) and 
m(r) were posed in the design. These were updated by using a least squares 
quadratic curve to project required future values of H as data accumulated. 
During the run of the trial, projected values of the end of trial functionals v(t) 
and m(r) did not vary more than ±5%. Interim analyses occured starting in 
Spring of 2006 and continued annually until the 5th analysis. The 6th analysis 
occured 6 months after the 5th. Data on the primary endpoint was backdated 
roughly 18 months to allow more complete ascertainment by the endpoint veri- 
fication team. The efficacy boundary was crossed at the sixth interim analysis, 
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using data backdated to January 15th 2009. Data on the primary endpoint was 
collected only for events occurring through December 31, 2009 so this was used 
as the scheduled termination date. The raw estimated weighted logged relative 
risk and its design-adjusted confidence interval were derived. The bias adjusted 
weighted logged relative risk was compared to the raw estimate. As the raw 
estimate is asymptotically unbiased, and since the crude risk ratio is the most 
straightforward and tangible summary of the trial results, the trial leadership 
decided to report the crude risk ratio together with the exponentiated raw esti- 
mate's design-adjusted confidence interval. 

7. Discussion 

We have shown that there is a natural clinically meaningful parameter, the 
weighted average logged relative risk, that is connected the weighted logrank 
statistic. When (3(t) does not change sign, the connection is a bijection. We 
have shown that under suitable shape assumptions, this bijection can be esti- 
mated at each analysis. We have shown how this bijection between the weighted 
logrank statistic and the weighted average logged relative risk allows the values 
of the monitoring statistic, efficacy and futility boundaries, and reported point 
estimate and confidence interval to be cast into a clinically meaningful scale. We 
have indicated how to derive a design-adjusted p- value and confidence interval 
and how bias adjustment of the estimate may be done using known methods. 
Finally, we have documented several decisions made in the design of the NLST 
interim analysis plan and in reporting its results on the primary endpoint. 
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8. Appendices 

8.1. Proof of Theorem 3.1 

We follow the usual method of adding and subtracting the differential of the 
compensator, and thereby express U n as a sum of a term that is asymptotically 
mean zero Gaussian process and a drift function which grows as y/n. 

U n (t) = -j=Y, Q(®{Xi-E n (t,0)}dMi{0 
V« . =1 J 

i " r* 

+ / Q(O{X l -E n (^0)}I(T. l >OeMX i q(OndH o (O 

V^ti-Zo 

I n rt 

+ f Q(0{En^n-E n (^0)}R n (^ndHo(0, (8-1) 
Jo 

where in the above, Rn(Z,P*) = 1/™E^( T * > exp(X;g(£),S*), and = 
l/(nR n (t P*)) XiHTi > exp(J5Qg(£)/3*). 

By linearizing the difference, E n (£, /3*) — E n (£, 0) about j3* = we obtain 



u n (t) = -ivfQtoiii-^o)}^) 



Jo 

(8.2) 

We normalize by \JV n (t) and replace the differential i? n (£, /3*)eL£fo(£) with 
dN n (£)/n. The latter is possible because integrals of bounded functions against 
the difference of the differentials arc consistent to zero. 

X n {t) = J_ J2 f Q(0 {Xi - 0)} dMi(0 

V n {T) i=1 Jo 



/ (3(0 9 (^(P){i-£»((,o)} 

* / n( T ) JO 

(Q|^n|?>i 



div„(e) 



- w " (Mur)) + <8 - 3) 
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The first term is easily recognized to be asymptotic in distribution to a standard 
Brownian motion. The reader can either directly apply Robolledo's martingale 
central limit theorem, verifying that in the case that integrands and intensities 
are bounded all conditions are satisfied, or apply a more direct result, such as 
theorem (6.2.1) in Fleming and Harrington [2]. Under the family of local alter- 
natives, /3* = b* I y/n, then by the comments following expression 8.2, the second 
term is easily seen to be consistent to the drift function listed in expression 3.2. 
Therefore the result follows by Slutzky's theorem. 



8.2. End of Trial Functionals 

In this section we demonstrate how to project values of the variance v(t) = 
(Q\IF\Q) T , and the "first moment" m(r) = (Q|2F|1) T at the scheduled end 
of study, r. This is done in the specific case of the "ramp plateau" weighting 
function which was used for interim monitoring and reporting in the NLST. 
This is the function which takes the value at t = 0, has linear increase to the 
value 1 at t = t c and then maintains this constant value forward. 

Q(t) = ~ A 1 (8.4) 

In the NLST, the value of t c = 4 years was used. Next, by imposing some mild 
assumptions we will be able to express all quantities in the integrands in terms 
of the cross-arm pooled cancer mortality cumulative hazard function, H and 
thereby solve the integrals via a simple change of variables. The resulting ex- 
pressions require only values of H(t) at t = t c , t = r — t er and t = r, where 
t er is the calender time at which randomization was concluded. First we shall 
list the required assumptions. In the following discussion, S, Si r and S Q th are 
survival functions corresponding to the cross-arm pooled cancer mortality, ad- 
ministrative censoring or "live removal" and other cause mortality. The latter 
two were the only sources of censoring in the NLST because complete ascertain- 
ment with respect to mortality was possibly through the use of the matching 
death certificates through the national death index. 

Condition 8.1. Other cause mortality is proportional to cancer mortality, i.e. 
that 9 = —dlog(S th)/dH is constant. 

Condition 8.2. Proportional allocation: e(£, 0) = e(0,0). 

Condition 8.3. Accrual is uniform on the scale of H, so that 

where r is the time at which the required number of events are obtained, and t er 
is the time at which randomization is completed. 



Condition 8.4. 



t c 1 - exp(-if (i c )) 
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The other cause versus cancer proportionality assumption is perhaps the most 
arguable. However, the extent to which it is violated in practice has little impact 
upon our results as other cause mortality enters our results only through its sur- 
vival function which maintains a value in excess of 0.95 throughout the trial. 
The proportional allocation assumption approximates what we see in practice 
quite closely, especially in the case of a large trial of a rare event. In the NLST 
there was 1 to 1 randomization so that e(0, 0) = 1/2. The extent to which the 
latter two assumptions 8.3 and 8.4 hold both depend upon the extent to which 
pooled cancer specific mortality grows at a constant rate. In the case of the 
NLST, the pooled cancer mortality cumulative hazard function did grow at an 
approximately linear rate. 

Variance at Planned Termination 



Here, S, Si r and S th are survival functions corresponding to the cross-arm 
pooled cancer mortality, administrative censoring or "live removal" and other 
cause mortality. The latter two were the only sources of censoring in the NLST 
because complete ascertainment with respect to mortality was possibly through 
the use of the matching death certificates through the national death index. 
Therefore, we can express the differential, dG, in this way. Under assumptions 
8.1, 8.2, 8.3, and 8.4, we apply the change of variables, rj = to obtain 





Q 2 (£)e(£, 0) (1 - e(£, 0)) S oth (0Si r (0S(0dH(t) . (8.7) 






+ 



A{H{T)-H{T-t er )) 7 H(r _ tcr) 



7(r - t er < t c ) 



H(t c ) 

(1 - 2c-'" + c- 2 ") e - {e+1 ^ (H(t) - ?/) dr) 




h+h+h + h- 
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These evaluate to: 

1(1- c-^+^Hm 1 _ c -(9+2)H m l _ c -(9+3)H m -I 

h = - { 2 1 } where H m = H(t c ) A H(t - t e 

2 e (e+i)ff(t c ) _ c -(e+i)H(r-u r ) 



4(0+1) 



i(t c <r-t er ) (i- e - H ^y 

J(t - t er < t c ) 
4(JT(t) - ff(r - i er )) 

' e -(e+l)H(r-t er ) e -(e+2)H(r-t er ) e _(e+3)J?(T-t er ) 

+ 1 2 + 2 + + 3 



(H( T )-H(T~t er )) 



-(0+l)H(t a ) -{6+2)H{t c ) ( S +3)H(t c )* 



6 + 1 6 + 2 6 + 3 

e -(6+l)H( T -t er ) _ e -(9+l)H(t c ) e -(0+2)H(T-t er ) _ e -(e+2)J?(t c ) 

(0+T) 5 2 (0T2P 

e -(e+3)H(r-t er ) _ e -(9+3)ff(t c ) 



(6 + sy 2 



(l_e-»(tc)) 
4(0+1) 



2 



ff (r) - (H(r - W) V (t c )) , fl+lKjr , r _ t , e^ 1 )^"^^)) - e"^ 1 )^ 



H(r)-H(r-t er ) (9 + l)(F(r) - H(t - t e 

respectively. 

First Moment at Planned Termination 

m{r) = [ Q(Oe(e,0)(l-e(e,0))dG(e) 



Q(£M6 °) (1 - °)) Sot h (0Sir(0S(®dH(0 . (8.8) 
Under assumptions 8.1, 8.2, 8.3, and 8.4, we again apply the change of variables, 
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?/ = H(£), to obtain 

•»« - \r ('--' Ag "°') { g( /- T i(7-M "} ^« 

, r H(t c )AH{r-t er ) 

= -J (l-e"") e-^e-^dr) 

+ -I{t c <T-t er ) (l-e- H ^>) / e-^e-^r? 
4 v y 



+ I ,fc > r - <„, (1 - e-,) e- % (T) g _%- r l tgr) e-^ 

+ i/fe < r) (l - .-»•>) [""' g(r ';" , . 

4 V ' JH(t c )VH(T-t cr ) H \ T ) - H K T -ter) 

= J I + J 2 + h + Ji 

These evaluate to 

1 f 1 - e-( 0+1 ) (H(tc)AH(r-t er )) I _ e -(e+2)(H(t c )AH(r-t er )) 



6>+l 6> + 2 

J 2 = i/(t c <T-t er ) (l-e 
I(t c > t - t er ) 



1 ( n e -(e+i)ff(t c ) _ e -(0+l)ff(r-t er ) 

9 + 1 



■h = 



A{H{ T )~H{ T -t er )) 

' (H(t) - H(T - t er )) c^ 1 )^- *") - (H(t) - H(t c )) C -(9+l)^) 



9 + 1 

(ff (t) - H(T - t er )) c -(0+m{r-t, r ) _ ( g ( T ) _ g ( fc )) c -(9+2)g(t e ) 

6» + 2 

e -(e+l)J?(T-t er ) _ e -(0+l)H(t c ) e -{6+2)H{T-t eT ) _ e ~(9+2)H{t a ) 



Jx 



(9 + 1) 2 (9 + 2? 

I(t c<T ) (l-e- g ^)) 
i(H(r)-H(T-t er )) 



(H(T) - H(t c V (T - ter))) e-( e + 1 ) H ^ V ( T -«"-)) _ e -(e+l)g(teV(r-t er )) _ e -(fl+l)g(r) 

e~Ti {9 + 1? 



respectively. 
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Duration of Trial 

The duration the NLST was part of the design. In other situations in which the 
design stipulates that the trial should run until required number of events is 
attained, the above change of variables technique can be used to find a closed 
form expression for 



in terms of the projected values of H at t = r and t = r — t er . Then using the 
plug-in estimate TEN n (r)/n for G(t) this expression can be inverted to solve for 
r, the duration of the trial. 

8.3. Sampling density of (J, X n (tj)) 

As in Armitagc, McPherson and Rowe, [1], the sampling density of (J, X n (tj)) 
can be derived recursively as follows. Let Aj = f n j — f n ,j-i and let 4> v {x) = 
4>(x l 1 \pv) l 'y/v where </> is the density of the standard normal. First, 




(8.9) 



tt((1,x)) = <(> (x). 



(8.10) 



Next, for all j > 1, 



7r(C?» ; bi:(j-_i),fl: 3 -) 




(8.11) 



